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ON A SINGLE NUMERICAL INDEX OF THE AGE DISTRIBUTION 

OF A POPULATION 1 

By Raymond Pbarl 

Department of Biometry and Vital Statistics, Johns Hopkins University 

Read before the Academy, April 27, 1920 

It is an obvious fact that the crude death rate of any community is 
influenced in a marked degree by the age distribution of the living popula- 
tion of that community. Before any critical conclusions about the true 
force of mortality can be drawn some sort of correction must be applied 
to take account of the age distribution of the population. If one desires 
to make any analysis of the correlation of death rates with each other or 
with sundry environmental factors it is an absolutely essential prerequisite 
that there be found some single numerical expression which will be an 
approximate index of the age distribution of the population of each lo- 
cality dealt with. 

From a mathematical viewpoint the problem presented is one incapable 
of exact solution. The problem mathematically is to find a single parameter 
which will fully describe so complex a curve as that of the age distribution 
of a population, the general form of which is indicated in figure 1. But 
this is impossible. No single parameter can possibly describe fully such 
a curve. The best that can be done is to approximate as closely as may 
be to the impossible ideal. It is the purpose of this paper to describe and 
illustrate an approximation which comes very close to the requirements, 
indeed quite sufficiently so for all practical statistical purposes in all cases 
where it has yet been tested out, and is easy to calculate. 

In a recent paper, 2 I proposed as an index of differences in age compo- 
sition of populations, and applied it to 40 American cities, the expression 
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x 2 = 5^j (1) 

"where A is the deviation for each of six age groups (viz, 0-4, 5-14, 
15-24, 25-44, 45-64, 65 and over) of the percentage of the actual popula- 
tion of each city in 1910 in each age group, from the percentage in the same 
group in the Standard Population of Glover's 8 Life Table, denoted in 
the formula by P. 5 denotes summation of all six values. The value 
X 2 measures the extent to which each city deviates in the age con- 
stitution of its population from a fixed standard, but does not tell 
the nature or kind of the deviation." 

In the paper cited it was shown empirically that in spite of the obvious 
defect of this index mentioned in the last sentence, it was adequate for the 
restricted material and purposes there involved. Since that paper was 
published it has occurred to me that the outstanding defect of the former 
index will, to that degree of accuracy which is requisite for all practical 
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statistical purposes, be entirely removed by adopting as an age-consti- 
tution index the function 
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(M-M p ) 



(2) 



where S, A and P have the same significance as before, and M —. mean 
age of living population in any community, M p — mean age of persons 
in a stationary population unaffected by migration and which, assuming 
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Fig. 1. Diagram showing age distribution of population, grouped in six age classes. 
The last class in this and other diagrams is assumed to have its ending at age 95. 
The solid line gives the stationary life table population, and the broken line the 
population of Albany. Total area under each curve = 100%. 

the mortality rates of Glover's Life Table, would result if 100,000 persons 
were born alive uniformly throughout each year {M t calculated from 
L x line of Glover's Table (p. 1.6) = 33.796 years). 

This procedure simply multiplies our former index x 2 by the difference 
(given its proper sign) between the mean age of the observed population 
and the mean age of the standard population on the basis of which x 2 
was calculated. Since in fact the mean age of any actual urban popula- 
tion is never likely to be as great as the mean age of the stationary popula- 
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tion chosen as a standard of reference the actual values of <f> will practically 
always be negative for cities. The smaller these negative values are 
numerically the greater will be the proportion of older persons in the 
population concerned. In short this function <j> tells us not only the de- 
gree to which a given population deviates in its age distribution from a 
fixed standard age distribution, but also the nature of this deviation, 
whether on the one hand in the direction of a relative excess of aged, or 
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Fig. 2. Diagram showing age distribution of population, grouped in six age classes. 
The solid line gives the stationary life-table population, and the broken line the 
population of Atlanta. Total area under each curve = 100%. 



on the other hand in the direction of a relative excess of the young. 
Theoretically it is possible for two populations differing from one another 
in a compensatory way to give the same values for the index #. But 
two populations which differ in age distribution in any fundamental re- 
spect which could affect appreciably crude death rates will, in all popula- 
tions I have been able to test, give different values of <j>, provided the age 
classification from which the function is calculated is finely enough divided. 
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The numerical values of the index 4> for a series of American cities, on 
the basis of the age distribution of the 1910 census are given by way of 
illustration in table 1. The cities are arranged in descending order of de- 
parture from the L x population. 

The extent of the departures from the standard population denoted by 
the several values of the indices may be indicated graphically by plotting 
the age distribution curve, on a percentage basis, for the two extreme cities 
in the table, Albany and Atlanta, against the L x population. This is 
done in figures 1 and 2. 

The general characteristics of an urban population as compared with a 
stationary life table population are well shown in these diagrams. In 
general a city has population under 40 to 45 in excess, and population 
older than that in defect. The reason is obvious. 

TABLE 1 

Showing Age-Constitution Indices op the Population of American Cities 



Albany 

Los Angeles . . 

Oakland 

Washington . . 
Cincinnati 

Dayton 

Indianapolis . . 
Rochester. . . . 
Grand Rapids 
Columbus .... 
Providence . . . 
Cambridge. . . 

Boston 

Baltimore. . . . 

Louisville 

Toledo 

New Haven. . 



-10.73 

- 13.61 

- 17.18 
-20.67 
-22.93 

- 24.10 

- 24.23 

- 26.62 
-27.95 

- 29 . 08 

- 30.05 

- 30.26 
-31.05 

- 31.80 
-32.19 
-33.21 

- 33 . 70 



Lowell 

Philadelphia. . 
San Francisco 

St. Louis 

Nashville .... 

Buffalo 

New Orleans. 
Minneapolis. . 
Milwaukee . . . 

Newark 

Chicago 

St. Paul 

Pittsburgh. . . 
Fall River. . . 
New York . . . 

Cleveland 

Atlanta 



- 34.91 

- 39.51 
-43.35 
-44.57 

- 48.32 
-48.61 
-52.44 
-55.39 

- 62 . 17 

- 62 . 54 
-68.76 

- 68.95 

- 71.77 
-73.39 
-74.42 
-74.51 
-82.71 



III 

It will be well to see in detail how the age distributions of some cities 
having nearly the same value for </> compare in respect of actual age dis- 
tribution of their populations. Table 2 furnishes such comparisons for 
four pairs of cities, covering fairly the whole rank of values of <j> shown in 
table 1. 

From this table it is seen that as we pass from cities having a value of 
4> of about 24 to cities having <j> equal to about 74 there is a steady change 
from populations having relatively many persons in the higher age groups 
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TABLE 2 

Percentage Age Distribution op the Population of Certain Cities in 1910 





DAYTON 


INDIAN- 
APOLIS 


PROVI- 
DENCE 


CAM- 
BRIDGE 


NASH- 
VILLE 


BUFFALO 


NEW- 
YORK 


CLEVE- 
LAND 


AGE 
CLASSES 

(years) 


O 




in 

O 
O 


O 








to 




II 

■9- 


II 
-6- 


II 

■e- 


II 


II 
■e- 


II 
-9- 


II 

■e- 


II 


0- 4 
5-14 


9.1% 
15.4 


8.0% 
15.3 


9.7% 
16.5 


10.3% 

17.5 


9.2% 
17.8 


10.0% 

18.3 


10.6% 
18.1 


11.1% 

17.4 


15-24 


19.5 


19.4 


19.0 


18.4 


21.9 


20.6 


20.8 


20.5 


25-44 


34.6 


35.9 


34.2 


33.3 


32.2 


32.3 


33.9 


34.4 


45-64 


17.0 


17.0 


16.2 


16.0 


15.1 


15.5 


13.7 


13.4 


65 & over 


4.4 
100.0 


4.3 
99.9 


4.2 
99.8 


4.4 
99.9 


3.8 


3.4 
100.1 


2.8 
99.9 


3.0 


Totals 


100.0 


99.8 



to populations having relatively few persons in those groups. Further- 
more, it is evident that in each of the four pairs of cities compared the agree- 
ment between the two cities having nearly identical values of <j> is very 
close in respect of actual percentage distribution of the population. Day- 
ton and Indianapolis were for all practical statistical purposes identical 
in the age distribution of their populations in 1910. So were Providence 
and Cambridge, Nashville and Buffalo, and New York and Cleveland. 
In each case the curve for one distribution of the pair winds in and out 
about the path set by the other. Of course, we should get finer differen- 
tiations brought out by the 4> function if we used 15 or 20 age classes in- 
stead of the 6 here employed. But for the purposes of the investigation 
in which I am using these functions, and for purposes of illustration of 
method 6 classes are sufficient. A word of caution must, however, be 
emphasized here. The reliability of <b as an approximate index of differ- 
ences in age distributions of population is greater as we pass in either 
direction towards the ends of its range of values. In the case of popula- 
tions giving values of <j> near the mean (say in the thirties for American 
cities) it may be necessary in order to get realty differentiant values to 
calculate from a rather fine age grouping. 

In general it is believed that the function of an age distribution of a 
population here proposed will give, in a single numerical expression, a 
substantially accurate indication of the essential nature of that age dis- 
tribution, and will facilitate the differentiation and classification of popula- 
tions in respect of this characteristic for statistical studies, particularly 
by the method of multiple correlation. 
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